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Abstract. The Aw-Rascle-Zhang (ARZ) model can be interpreted as a generalization of 
the first order Lighthill-Whitham-Richards (LWR) model, possessing a family of fundamental 
diagram curves, rather than a single one. We investigate to which extent this generalization 
increases the predictive accuracy of the models. To that end, a systematic comparison of 
two types of data-fitted LWR models and their second order ARZ counterparts is conducted, 
via a version of the three-detector problem test. The parameter functions of the models 
are constructed using historic fundamental diagram data. The model comparisons are then 
carried out using time-dependent data, of two very different types: vehicle trajectory data, 
and single-loop sensor data. The study of these PDE models is carried out in a macro- 
scopic sense, i.e., continuous field quantities are constructed from the discrete data, and 
discretization effects are kept negligibly small. 



1. Introduction 

In this paper the accuracy of various macroscopic traffic models is investigated, by com- 
paring the model predictions with measurement data. Specifically, we investigate to which 
extent second order traffic models (here: the Aw-Rascle-Zhang (ARZ) model [2j[60]) improve 
the accuracy of the predictions compared to the "first order" Lighthill-Whitham-Richards 
(LWR) model |344 HB] . This question is natural, because the ARZ model can be interpreted 
as a generalization of the LWR model (see Sect. [4]). 

The comparison is carried out via a variant of the three-detector problem |1U| : on a segment 
of a highway, on which the traffic state is known on both ends (at all times), a traffic model 
is used to simulate the evolution of the traffic state on the full segment. The thus predicted 
traffic state is then compared with the measured traffic state at certain positions in the interior 
of the segment. In order to ensure that the results are not specific to one particular type of 
measurement/aggregation, we conduct said comparison using two very different data sets: 
vehicle trajectory data, using the NGSIM data set [52], and single- loop sensor data, using the 
RTMC data set [12], provided by the Minnesota Department of Transportation (Mn/DOT). 
These data sets are described in more detail in Sect. [2[ 

The LWR and the ARZ model are representatives of macroscopic traffic models, i.e., they 
describe the evolution of field quantities that are defined continuously in space and time, such 
as vehicle density, velocity, and flow rate, via partial differential equations (PDE). There is 
a wide variety of alternative ways to describe vehicular traffic flow, such as via microscopic 
models (e.g., [13110]), mesoscopic models (e.g., [201 SSI [29l [21] ) 5 probabilistic models (e.g., [I]), 
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and cellular models (e.g., |38j). Moreover, the LWR and the ARZ model are inviscid models, 
i.e., thin transition zones of strong gradients (such as upstream ends of traffic jams) are 
replaced by shocks, whose evolution is determined by the Rankine-Hugoniot jump conditions 
[TT] . This is in contrast to viscous models (e.g., [STJ EE]) or KdV-type models (e.g., [3T| [30] ) . 
which model the traffic dynamics on multiple length scales through viscosity or dispersion, 
respectively. An overview over many of these types of models is provided in |19j . 

The macroscopic description of traffic flow does not attempt to follow each individual ve- 
hicle. Instead, information is interpreted in an aggregated (over multiple lanes) and averaged 
(over space and time) fashion. As a consequence, small scale noise in the drivers' behavior 
averages out, and a description in terms of deterministic quantities is obtained. Furthermore, 
many field quantities used in macroscopic models are closely linked to information that is 
directly recorded by stationary sensors, such as flow rate (number of vehicles passing through 
a fixed position per unit time) . Finally, macroscopic models are related to other means of de- 
scription: they can be derived as limits of mesoscopic models [321 ES| ; cellular models can be 
derived via their discretization ("cell transmission models") [7J [8] in Eulerian variables; and 
microscopic models can be derived via their discretization in Lagrangian variables. Hence, 
through these relations, the results obtained in this study could be of relevance to other types 
of traffic models as well. 

In this paper, we conduct a comparison of PDE models with data, in a fashion that is as 
close as possible to a macroscopic description. Specifically: 



The discrete measurement data is "transformed" into field quantities that are defined 
continuously in space and time (the specifics being described in Sect. 6.1 for vehicle 
trajectory data, and in Sect. 6.2 for stationary sensor data). 

The numerical scheme to approximate the PDE of the traffic model is solved with a 
spatial and temporal discretization small enough, so that the numerical approximation 
error is negligibly small relative to the errors in the models and the data (see Sect. [5] 
for details on the numerical scheme). 

The deviation of the model output from the data is measured in an L 1 fashion, i.e., 
via continuous integrals over space and/or time (see Sect. 7.1 for the definitions of the 
error metrics). 



The key advantage of this approach is that the PDE models are studied in their most original 
form, and free from discretization effects and parameters, such as cell sizes in cell transmission 
models. 

While the model comparison is conducted via time-dependent data, the model parameter 
functions are determined from historic fundamental diagram data that does not contain any 
temporal information (see Sect. [3]). In this sense, this study is very different from online cali- 
bration techniques, such as for instance Kalman filters [23], which have been quite successfully 
applied in the area of traffic modeling, in the form of nonlinear unscented [36], extended |55| . 
and ensemble [58] Kalman filtering. Here, once the model parameters have been determined, 
the model remains completely fixed, and there are no "black-box" parameters. Instead, all 
parameter functions have a distinct physical interpretation in terms of traffic flow (see Sects. M 
and[4]). That being said, we are not advocating that one performs traffic forecasting only with 
fixed models. Instead, it is our goal to provide some insight into which traffic models could 
qualify as good candidates to be used in combination with online calibration techniques. 
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The comparison results presented in Sect. [7] focus on two versions of the first order LWR 
model (corresponding to two types of flux functions), and their homogeneous ARZ model 
generalizations. These four models are compared in terms of their predictive accuracy with 
respect to measurement data. In order to guarantee a meaningful test of the models, it is 
ensured that in both the trajectory and the sensor data sets, a wide range of congestion level 
is present. 



2. Description of Data Sets 

For this study, we use two very different sets of data. First, we consider an NGSIM [53] 
data set, collected on April 13, 2005 on a 500 meter segment of the eastbound direction of 
1-80 located in Emeryville, CA. Using video camera technology, the precise trajectories of all 
vehicles in the segment are accessible (with a temporal resolution of 0.1 seconds), in three 
15-minute intervals at the onset and during rush hour: 4:00pm-4:15pm, 5:00pm-5:15pm, and 
5:15pm-5:30pm. In addition, historic fundamental diagram data for the segment, obtained 
via loop detectors, is available |52j . 

The freeway segment consists of six lanes, and after 125 meters, an on-ramp feeds further 
vehicles onto the road. The ramp is a nuisance, since in this study we do not attempt to 
consider models for merging traffic. However, we have found that the number of vehicles that 
enter the freeway is very small relative to the main flow, and therefore we simply treat the 
segment as if there were no ramp. The three recorded time- intervals possess increasing levels 
of congestion. 

Second, we use parts of the RTMC data set [12] from the year 2003, provided by the 
Minnesota Department of Transportation (Mn/DOT). This data set is obtained via single 
loop sensors, which measure the traffic volume (number of vehicles passing a sensor, M) and 
occupancy (fraction of time that a vehicle is blocking the sensor, a) at fixed positions on the 
highway, aggregated over intervals of Ai a = 30 seconds. From these quantities, the average 
flow rate Q = jV/i a and the average density p = oft at the sensor position are calculated, 
where t is the average vehicle length, here set to £ = 5m. In the case of multiple lanes (each 
lane has its own sensor), the resulting values of p and Q are added up. The average velocity is 
then obtained as u = Q/p- Note that significant inhomogeneities in vehicle length (e.g., due 
to patches of trucks) might lead to inaccuracies in the reconstructions for p and u (patches of 
trucks would lead to overestimated densities, and underestimated velocities). This common 
weakness of single loop sensors could be overcome by the use of double loop detectors. 

For our investigations in this paper, we consider three successive sensor positions (we call 
them sensors 1, 2, and 3) along the southbound direction of I-35W, south of its intersection 
with 1-94. The highway segment between sensors 1 and 3 has a length of 1.224 km, consists 
of four lanes, and is free from ramps. For our model comparison, we use the sensor mea- 
surement data on 74 week days (Monday-Friday) in the time from January 1 st through April 
16th, 2003. On each day, we consider the hour between 4pm and 5pm, during which the 
development of rush hour traffic falls. In addition, to generate the model functions, we use 
historic fundamental diagram data from sensor 2, recorded over the whole year. 

In both types of data sets, various levels of traffic congestion occur during the test intervals. 
The study of congested traffic states is important to create a sufficiently challenging, and 
thus meaningful, test for the quality of traffic models. For both types of data, the historic 
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fundamental diagram data can be used directly to create the model functions and parameters, 
as described in Sects. [3] and |4j In contrast, to allow a study of macroscopic traffic models 
via a three-detector problem setup, macroscopic field quantities must be constructed that are 
defined continuously in time, and possibly in space as well. The creation of the macroscopic 
field quantities from the semi-discrete trajectory data or aggregated sensor data is described 
in Sect. [(J 



3. Data-Fitted Fundamental Diagram Curves 

The fundamental diagram (FD) of traffic flow is a basic tool in traffic engineering to un- 
derstand the flow capacity of a roadway. It is commonly constructed by plotting pairs (p, Q) 
as data points in a flow rate vs. density diagram, in which these data points are obtained 
via long term sensor measurements on the road. The flow rate Q is the number of vehicles 
passing a specific position on the road per unit time, and the density p is the average number 
of vehicles per unit length. Fig. [T] shows a typical example of such a cloud of data points, here 
taken at "sensor 2" (see Sect. [2]) in the RTMC data set. One can quite distinctly see an almost 
perfect functional relationship between p and Q for small densities, as well as a significant 
spread for larger densities. Both features are typical for FDs on highways worldwide, and are 
commonly classified in terms of traffic phases: free flow for low densities; and congested flow 
for medium to large densities (see, e.g., |19l I54j ). or synchronized flow for medium densities, 
and wide moving jams for high densities (see Kerner [26 j ) . 

Early attempts to describe the FD of traffic flow ignored the spread in the congested flow 
regime, and described traffic by a single "traffic flux" function Q{p). A particularly simple 
choice is the Greenshields flux [18] 

Q(p) = /91i m ax(l - p/Pmax) , (1) 

whose quadratic form was based on Greenshields's first FD measurements conducted in the 
1930s. Subsequent measurements |561 115j in the 1950s illuminated that the Greenshields flux 
is not an accurate description of true FD data, and as a reaction, many other types of traffic 
flux function have been proposed and studied (see for instance [2H [151 EH HD El [55]). 

The LWR model (see Sect. [4]) is defined precisely through the choice of a flux function 
Q(p)- A natural way to construct such a single- valued function from a measurement FD is to 
propose a particular functional form, and then determine the best fitting representative from 
the proposed class via a least squares fit to the data. Clearly, such an approach can never 
represent the spread of the data in the congested flow regime. However, several of the key 
features of the data FD can be captured. In this paper, we consider two types of functions 
Q{p)'- a three-parameter family of functions that can be interpreted as smoothed and strictly 
concave versions of the piecewise linear Daganzo-Newell flux function [41] [7] ; and a quadratic 
function, as proposed by Greenshields [15] . 

We start with the construction of the former. We consider a three-parameter family of 
functions 

Qa,x, P (p) = « (« + ( b ~ «)^ " Vi + y 2 ) , (2) 

where 

a = y/1 + (Ap) 2 , b = v / l + (A(l-p)) 2 , and y = A - p) . 
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Flow rate curve for LWR model 



Flow rate curve for LWRQ model 




density p 



Figure 1. Two possible flow rate vs. density curve Q(p), fitted to the historic fundamental diagram 
data. Left: LWR model, based on the three-parameter function Right: LWRQ model, using the 
Greenshields flux ([!]). 



The function Q a ,x,p(p) is strictly concave down, and vanishes at p = and p 
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is strictly decreasing. The three degrees 



over, the induced velocity profile J7 a ,A,p(p) 
(if freedom a, A, and p determine the position and the value of the maximum flow rate, as 
well as the "roundness" of Q near this maximum. Specifically, large values of A result in Q 
being close to the piecewise linear Daganzo-Newell flux function |41|. [7] , while small values of 
A yield a shape close to an inverted parabola. 

Given historic fundamental diagram (FD) measurement data (pj,Qj), i = 1, 
determine a, A, and p by solving the LSQ minimization problem 



, n, we 



mm J2\Q a ,\, P (pj) 
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The resulting function Q(p) lies nicely "in the middle" of the cloud of measurement points in 
the flow rate vs. density diagram (see the left panel of Fig. [IJ). Note that we do not determine 
the stagnation density p max from data, since it is (a) generally insufficiently well represented 
by data, and (b) more or less determined by the physical restrictions of the road and vehicles, 
rather than representing drivers' behavior. We choose 

#lanes 
7.5m 

where the 7.5 meters represent 5 meters average vehicle length, plus 50% safety distance. 

As visible in the left panel of Fig. [IJ the optimal function of the form ([2j resembles the 
general shape of the data FD (except for the spread) quite well. In the following, this function 
shall be the key ingredient in the models that we denote "LWR" and "ARZ". However, 
frequently the application of interest calls for even simpler functions Q(p), for instance because 
rapid function evaluations are required, or because insufficient historic data is accessible. We 
therefore consider also the Greenshields flux Q, which, since p max is fixed, involves only a 
single free parameter: the empty road velocity u max . Since the Greenshields flux does not 
resemble the general shape of the FD data well (because the densities at which the flow rates 
become maximal do not coincide, see the right panel of Fig. [TJ, we do not perform a LSQ 
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fit. Instead, u max is determined from Q, such that the slopes of Q(p) at p = match. The 
Greenshields flux is a quadratic function. Hence, we denote the models that are based on 
it "LWRQ" and "ARZQ" . 



4. First and Second Order Traffic Models 

The basis of macroscopic traffic models is the conservation of vehicles, given by the conti- 
nuity equation 

Pt + (pu) x = . (3) 

Here p(x, t) denotes the density of vehicles, where x is the position along a road, and t is 
time. Here, we consider multiple lanes be aggregated into a single, one-dimensional (plus 
time), density function. The Lighthill-Whitham- Richards (LWR) model [34, 48J is based on 
the assumption of a functional relationship between p and u, defined by the velocity function 
u = U(p), which we assume strictly decreasing with p. This induces a flow rate function 
Q{p) = pU(p), which defines a unique Q vs. p relationship in the associated fundamental 
diagram of traffic flow (see Sect. [3]). With this choice, equation ^ becomes a scalar hyperbolic 
conservation law 

Pt + (Q{p)) x = o , (4) 

and is thus denoted a first order model. The two choices of data-fitted flux functions described 
in Sect. [3j and depicted in Fig. [TJ lead to the models that we denote "LWR" and "LWRQ". 

Second order models add an evolution equation for the velocity field u(x,t) to The 
first second order model, the Payne- Whitham (PW) model [45l [57], does so by introducing 
a "traffic pressure" in analogy to fluid dynamics. Another second order model, proposed to 
overcome certain "non-physical" properties [9] of the PW model, is the Aw- Rascle- Zhang 
(ARZ) model pQIBU]. The homogeneous ARZ model is 

p t + (pu) x = , 

(5) 



(u + h{p)) t + u(u + h{p)) x = , 



where we call h(p) the hesitation function. Note that h(p) is frequently called "pressure", 
and denoted p(p), even though it does not play the role of a pressure in the equations. We 
assume that h'{p) > and h(0) = 0. The inhomogeneous ARZ model [21 US] is obtained by 
adding a relaxation term 

Pt + {pu) x = , 

(u + h{p)) t + u{u + h(p)) x = \(U(p)-u) , (<>) 

implying that drivers adjust their actual velocity to the desired velocity U(p) with a relaxation 
time scale r. The homogeneous ARZ model ^ does not let drivers adjust their driving 
behavior over time (most prominently, there is no mechanism that would let drivers accelerate 
when starting with all vehicles at rest u = 0). In contrast, the inhomogeneous ARZ model 
^ does. Hence, one can expect that §5§ yields satisfactory predictions for fully established 
traffic flow whose nature does not change significantly in time, but that for strongly varying 
traffic conditions, ^ will be a better model. In this study, we restrict to the homogeneous 
ARZ model ([5]); the inhomogeneous ARZ model ^ shall be considered in an upcoming paper 
[121. 
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Figure 2. Flow rate vs. density curves (top row) and corresponding velocity vs. density curves (bot- 
tom row) for the LWRQ, LWR, ARZQ, and ARZ models (from left to right). 



In a loose analogy to fluid dynamics, the conservative form of ([5]) is understood as 

Pt + Qx = , 

ft + = o, (7) 

where the two conserved variables are p and q = p(u + h{p)). 

The ARZ model can be interpreted as a generalization of the LWR model, as follows. 
System ^ can be rewritten as 

Pt + (pu) x = , 

w t + uw x = , (8) 
where u = w — h(p) . 

This means that traffic moves with the velocity field u, and the quantity w = u + h(p) is 
advected with the flow u. Moreover, for p = we have that w = u. Therefore, w can be 
interpreted as a property of each vehicle, namely its empty road velocity. This interpretation 
gives rise to the homogeneous ARZ model consisting of a one-parameter family of velocity 
vs. density curves, namely u w (p) = w — h(p). In turn, the LWR model consists of a single 
velocity vs. density curve, namely u(p) = U(p). With the choice h(p) = (7(0) — U(p), the 
LWR curve is contained in the ARZ family of curves, and therefore the ARZ model can be 
interpreted as a generalization of the LWR model: it models the same velocity decrease with 
increasing density as LWR, but it allows for drivers with different empty road velocities. 

This relationship between the LWR and the ARZ model is depicted in Fig. [2j for flow rate 
vs. density at the top, and for velocity vs. density at the bottom. In each figure, the left two 
panels show the single- valued functions Q(p) and U(p), respectively, for the two first order 
models LWRQ and LWR. The right two panels in each figure then correspond to the second 
order generalizations ARZQ and ARZ: the first order functions Q{p) and U(p) remain as 
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so-called "equilibrium curves" (shown in red). In addition, a one-parameter family of curves 
is added, parameterized by the empty-road velocity w as 

u = U(p) + (w- U(0)) (9) 

and 

Q = Q(p) + p(w-U(0)). (10) 

Through the relationships ^ and (10), the second order models ARZQ and ARZ inherit 
the data-fitted parameter functions from their first order counterparts LWRQ and LWR. It 
is important to stress that the second order models do not contain any further data-fitted 
parameters beyond that. Nevertheless, one can hope that the second order models perform 
better at "predicting" the evolution of traffic flow, for two main reasons: 

• In first order models, drivers adjust instantaneously to changes in density. In contrast, 
the "momentum" equation in second order models can be interpreted as a model for 
acceleration and deceleration. In this sense, second order models might be more 
realistic. 

• At the boundaries of the study segment of road, boundary conditions must be pro- 
vided. Second order models call for more boundary data than first order models, and 
therefore one can "feed" more independent data into the model than with first order 



models (see Sect. 5.2 for more details on how to provide boundary conditions). 



In turn, the families of curves in Fig. [2] illustrate also a shortcoming of the ARZ models, 
namely the fact that the curves u w {p) = w + U (p) — U(0) do not possess a common stagnation 
density /? max - This is clearly unrealistic, since the maximum capacity is a property of the 
roadway, independent of the empty road velocity of a driver. One can therefore expect that 
at extremely high traffic densities, the ARZ model does not perform as well anymore as at 
small and medium densities. The results in Sect. [7] shall reveal whether the advantages of the 
ARZ models outweigh this shortcoming. 



5. Numerical Scheme 

The LWR model Q, as well as the ARZ model ((TJ), can be cast general hyperbolic 
conservation law (system) 

& + = 0, (11) 

for the solution (p(x,t). Here, (ft = p and / = Q for LWR, and (ft = (p,l) an d f(p,q) = 
(q, q 2 /p - h(p)q) for ARZ. 



5.1. Approximation of the PDE. We approximate the solutions of (11) using a standard 
Godunov method [14] . This finite volume method, which is commonly used in traffic flow 
applications (see e.g. [23 H3 [221 EZj), is based on the solution of local Riemann problems, 
which are well established for the LWR model [34J and for the ARZ model [2j . In the numerical 
approach, the road segment is divided into cells of size Ax, and the solution is advanced 
forward in time increments At, which satisfy the CFL condition [6j 
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where s max is the largest wave speed. In each cell [xj — ^f,Xj + -^], the average solution 
value 

I f x j + l/2 

<f)(x, t n ) dx 



1 A 



x 



is approximated. Cell-averages are updated in time by considering the balance of fluxes / 
through the boundaries to adjacent cells. The resulting update rule reads as 

/,n+l jjn. ( pn pn j 



^ ^ Ax V i+s J-3> 
where the Godunov flux between cell j and cell j + 1 is defined as 



and <p* (<pL, 4>r) is the solution of the Riemann problem between the states 4>l and (fin, eval- 
uated at x = 0. 

As argued in Sect. JTJ we are interested in studying the predictive accuracy of the "pure" 
PDE traffic models Q and ([7]). To that end, we choose the spatial discretization Ax suf- 
ficiently fine, so that the numerical approximation errors are negligibly small compared to 
the errors in the models and the data (we always choose Ax < 50cm). This is in contrast 
to certain types of cell transmission models, in which the cell size Ax is not small, and thus 
enters as an additional parameter into the model, and its influence on the solution must be 
considered. 



5.2. Boundary Conditions. The hyperbolic conservation law ( |11[ ) describes the evolution 
of solutions in the interior of the space-time domain (x,t) £ [0, L] x [0,T], where L is the 
length of the road segment, and T is the final time up to which the solution is of interest. In 
addition, one must specify an initial condition 0(x,O), and boundary conditions. The aspect 
of boundary conditions for hyperbolic problems is non-trivial, since the amount and type of 
conditions to be specified at each boundary depends on the solution itself (see [32] )• The 
framework of the Godunov method allows an elegant solution to this problem: beyond each 
boundary (e.g., at x = 0), a ghost cell is added (e.g., at [— Ax, 0]), in which the numerical 
solution takes the state of the boundary data. The Riemann solver between the outmost 
interior cell (e.g., [0,Ax]) and the ghost cell will then pick up exactly the right amount of 
data, namely the components that correspond to waves that travel into the domain [0, L\. 

Using the measurement data, we have the density p and the generalized flow rate q available 
at the boundaries for all times. Since the LWR model Q is formulated in p, it uses only 
density boundary data (while ignoring q) , and the boundary density only influences the result 
if the resulting wave is going into the domain [0, L\. For the ARZ model ([7]), one wave is 
always right-going (a contact discontinuity that moves with the vehicle velocity, see [2]), while 
the other wave could go in either direction. Specifically, for sufficiently light traffic (p small, 
and u large), LWR picks up boundary data for p only at x = 0, and nothing at x = L, while 
ARZ picks up boundary data for p and q at x = 0, and nothing at x = L. In turn, for dense 
traffic {p large, and u small), LWR picks up boundary data for p only at x = L, and nothing 
at x = 0, while ARZ picks up some combination of p and u at each of the two boundaries. 
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6. Creation of Macroscopic Field Quantities from Measurement Data 

The numerical approximation of the macroscopic traffic models Q and ([7]) with small 
discretization errors (see Sect. [5]) requires the boundary data to be defined continuously in 
time. The same holds true for the reference data, which is used to compare the model 
predictions to. And similarly, the initial data (if applicable) must be defined continuously 



in space. In Sect. 6.1 we specify how continuous field quantities are constructed from the 



NGSIM trajectory data; and in Sect. 6.2 the treatment of the RTMC sensor data is described. 



6.1. Treatment of NGSIM Trajectory Data. In the NGSIM data set, we are given 
the trajectories of vehicles Xj(t), with a temporal resolution (0.1 seconds) that is essentially 
continuous in time (data at intermediate times is defined via a simple linear interpolation). 
However, at any instance in time, the data of the vehicle positions xi(i), . . . , xw(t), and their 
velocities u\(t), . . . , ujv(t), is discrete. In order to incorporate this data into, and compare it 
with, a PDE model, we must generate functions p(x, t) and u(x, t) that are defined everywhere 
on the road segment. The construction of density functions from discrete samples is an 
important problem is statistics, and a wide variety of approaches exists to address this task. 
As described below, we employ a kernel density estimation (KDE) approach, with a fixed 
Gaussian kernel, and a reflection method to remove boundary effects. 

The specific type of KDE approach used here, also called the Parzen- Rosenblatt window 
method [49, 44J, works as follows. Let N points be given at positions x%, . . . ,xn- This data 
is interpreted as a finite sample of some (unknown) density function p(x). The goal is to 
re-construct a kernel density estimator p{x) from the discrete information that is close to 
p(x) on an interval [a, b] . KDE starts with a comb function 

N 

°(, x ) = ^2 s ( x ~ x j) > 

3=1 

and then defines p as its smoothed version 




where A' is a smoothing kernel. Here, we consider a Gaussian kernel 



K(x) = G h (x) = 




whose bandwidth h is chosen as the smallest value so that equidistant vehicles generate an 
almost constant p for p > |p max - For the NGSIM data sets, this amounts to h = 25 meters. 
More details on the selection of an optimal bandwidth are provided for instance in [H 123] . 

The approach can be generalized to the case in which the points carry additional informa- 
tion ux, . . . ,ii/V) which is sampled from some (unknown) function u(x). An estimator for the 
weighted density q(x) = p(x)u(x) is obtained as 
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Therefore, an estimator of the function u is given by 

We use this approach at each time t on the NGSIM data set as a unified reconstruction 
approach of smooth estimators for density and velocity from vehicle trajectory data. Note 



that the velocity function ( 15 ) is a kernel density estimator, and therefore it is in general not 



an interpolant of the data Uj at positions xj 



The kernel density estimators for density (13) and flow rate dl4| are reasonable only at 



positions for which sufficiently many data points are present on either side. Therefore, for 
finite intervals, a correction is required near the interval boundaries (otherwise, a spurious 
reduction in density near the boundaries would result). We use a version of the reflection 
method described in [25], here described for the left boundary (the right boundary is treated 
analogously). Let the point positions be ordered x\ < ■ ■ ■ < xjy, and let d denote the average 
distance of interior points near x±. Then ghost points x\,...,x* k are added left of x\ via a 
reflection at a* = X\ — f , i.e. 



and the KDE formulas (13) and (14) are applied to the thus augmented data set (left of 



x\, and right of xn). On each side, the number of ghost points k is chosen, such that the 



estimators (13) and flow rate (14) do not "feel" the boundary effects anymore. 



6.2. Treatment of the RTMC Sensor Data. At the sensor positions, we reconstruct 
continuous density, velocity, and flow rate functions from the aggregated data of the respective 
quantities, as follows. For each interval [t n ,t n + Ai a ], we use the averaged quantity to define 
the value of the continuous function at the interval's mid-time, i.e., p(t n + ^Ai a ) = p n . Then, 
using these data points, the function p(t) is defined for all times by using cubic interpolation. 
The procedure for u and Q is analogous. This time-continuous data is then used at sensors 
1 and 3 to provide boundary conditions, and at sensor 2 to perform a comparison with the 
predictions of the model. 

In addition to boundary conditions, an initial state is needed to run a forward simulation 
of a macroscopic traffic model. Unfortunately, a state estimation from stationary sensors that 
are relatively far apart is not possible with a level of accuracy that would be required here, 
i.e., the model vs. data comparison would not be overshadowed by errors due to the state 
estimation. Therefore, we choose an alternative approach: before the comparison between 
traffic model and data is conducted, we run the traffic model through an initialization phase 
(about 5 minutes long), which is started with a uniform, low density traffic state. During this 
initialization phase, as information flows into the domain from the correct boundary data, 
the traffic model "fills" the interval with a reasonable state. 



7. Comparison Results 

In the following, we apply the four data-fitted traffic models LWRQ, LWR, ARZQ, and 
ARZ to the different sets of time-dependent boundary data to generate predictions of the 
evolution of the traffic state, which we then compare with time-dependent reference data in 
the interior of the domain. In Sect. 7.1 we define precise metrics that quantify the difference 
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of a model prediction with measurement data. Then, the results of the comparisons are 



presented, with the NGSIM data in Sect. |7.2[ and the RTMC data in Sect. 7.3 



7.1. Error Metrics for Model Accuracy Quantification. The traffic models produce 
predictions of the traffic state in the interior of the space-time domain [x, t) 6 (0, L) x (0, T). 
In order to quantify the predictive accuracy of a model, we evaluate its "error", i.e., the 
deviation of the predicted state from the state given by measurement data. We use an error 
metric of the following form. First, we assume that one is interested in predicting both traffic 
densities and velocities accurately, and therefore include both quantities into the error metric. 
Given a model output p modcl and u model ; and data p data and ii data at the same position and 
time, we define the scaled difference 

\pfto*(x,t) - (F^fat)] \u data (x,t) -u mode \x,t)\ , 
E(x,t) = J L + J L . (16) 

Pmax 'Umax 

Second, to remove the impact of noise and outliers in the data, we consider averages (in an 
L 1 sense). Specifically, we consider spatial averages 



1 



L 





for the NGSIM data set, and temporal averages 



E x (t) = - j E(x,t)dx (17) 



E\x) = i / E(x,t)dt (18) 



T 



for the RTMC data set (evaluated at X2, the position of sensor 2). As a consequence, for each 
of the three NGSIM data sets we can plot E x (t) as a function of time, and for the RTMC 
data we can plot the value of E t {x2) for each of the 74 considered days. 

Moreover, for each of the NGSIM data sets, we obtain a final error for each model by 
averaging over space and time 

E = YzJ o J E(x,t)dxdt, (19) 

and for the RTMC data, we obtain a fully averaged error for each model by averaging over 
all days 

#days 

E =#^E E ^fe>- < 20 > 

Since the model predictions result from a numerical scheme with a (small) cell size and time 



step, the integrals in (17), (18), and (19) are approximated (with negligible approximation 



error) using simple quadrature rules over the computational space-time grid. 

7.2. Model Comparison using the NGSIM Trajectory Data. The test setup described 
above is applied independently to each of the three NGSIM data sets. There is one caveat 
with the vehicle trajectories, namely that during the first minute, not all vehicles on the 
roadway are recorded. We therefore always start the simulation one minute after the data 
set starts. Hence, the three test cases cover the times 4:01pm-4:15pm, 5:01pm-5:15pm, and 
5:16pm-5:30pm. We first take a deeper look at the time segment 4:01pm-4:15pm. In order 
to obtain an impression of the model error at different times, the temporal evolution of the 
space-averaged errors ( fl~7| ) for the four models is shown in the left panel of Fig. [3} Here, the 
two first order models (LWRQ and LWR) are depicted by thick, solid, light-blue and light-red 
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Time-evolution of error (NGSIM 4:01-4:15 trajectory data) Average errors 




Time of day 

Figure 3. Comparison of traffic models for the 4:01-4:15 NGSIM trajectory data set. Shown is the 
temporal evolution of the errors (left), and the space-time average errors (right), all in a log-scale. 



Data set 


LWRQ 


LWR 


ARZQ 


ARZ 


NGSIM 4:01-4:15 
NGSIM 5:01-5:15 
NGSIM 5:16-5:30 


0.242 (+230%) 
0.255 (+201%) 
0.209 (+78%) 


0.127 (+73%) 
0.115 (+36%) 
0.124 (+6%) 


0.126 (+72%) 
0.149 (+76%) 
0.152 (+30%) 


0.073 
0.085 
0.117 


RTMC congested 
RTMC non-cong. 


0.352 (+110%) 
0.094 (+94%) 


0.167 

0.069 (+43%) 


0.173 (+4%) 
0.048 


0.172 (+3%) 
0.066 (+37%) 



Table 1. Comparison of traffic models for NGSIM data sets (rows 2-4) and RTMC data, separated 
into congested and non-congested days (rows 5-6). Given are the space-time averaged errors (19) for 
the NGSIM trajectory data, and time-and-day averaged errors (20) for the RTMC sensor data. In 



parentheses shown are the relative excess errors of the models with respect to the most accurate model 
(for each particular test). 



curves, respectively, and their corresponding second order generalizations (ARZQ and ARZ) 
are shown by thin, dashed, dark-blue and thin dark-red curves, respectively. Moreover, the 
according space-time-averaged errors (19) are shown in the right panel. 

The temporal evolution of the space-averaged errors shows a couple of interesting aspects. 
One can see that second order models tend to exhibit more oscillatory errors than first order 
models. This follows from the theory of scalar vs. systems of hyperbolic conservation laws: in 
first order models, temporal oscillations in the boundary data are turned into shocks, as they 
are transported into the computational domain, that then decay towards an uniform profile. 
In contrast, in the second order ARZQ and ARZ models, temporal oscillations of the data at 
the left boundary can turn into waves that are transported into the domain without decaying 
towards a uniform profile. These waves correspond to contact discontinuities in the Riemann 
problem (see |2]). 
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Another apparent aspect is that even if one model is significantly closer to the data than 
another model in the space-time averaged sense, this relationship need not hold consistently 
for all times in the space- averaged errors. For instance, the ARZ model yields a significantly 
lower total error than the LWR model. However, at a few short instances in time, the LWR 
solution is closer to the data than the ARZ solution. The same holds true for ARZQ compared 
to LWRQ. Note that such incidental behavior is not unreasonable in highly unsteady traffic 
flow. However, it demonstrates that studies of traffic outside of the free-flow regime require 
sufficient averaging in order to yield meaningful results. 

The main message to be concluded from this test is that the space-time averaged errors 
of the four models speak relatively clearly in favor of (a) the three-parameter data-fitted 
flux function over the Greenshields flux; and (b) second order models over their first order 
counterparts. These fully averaged model errors are also given in the first NGSIM row in 
Table [TJ together with the relative excess error of the LWRQ, LWR, and ARZQ model to 
the (smallest) ARZ model error, shown in parentheses. The second and third NGSIM row in 
Table [T] then show the full model errors (and the relative excess errors) of the four models for 
the NGSIM data sets at the other two time segments (5:01pm-5:15pm and 5:16pm-5:30pm). 
The general trends (a) and (b) described above persist, with one caveat: as the traffic becomes 
more congested (later time segments), the improvements due to better quality FD curves, and 
due to using a second order model, are less pronounced than they are in the transitional time 
segment 4:01pm-4:15pm. The fact that for higher traffic densities, second order models 
become less advantageous over their first order counterparts, is most likely related to the 
unrealistic variable stagnation density that the ARZ model possesses (see Sect. El). 



7.3. Model Comparison using the RTMC Sensor Data. The three-detector problem 
test setup is now applied to the RTMC sensor data, independently for 74 week days, and on 
each of these days for the hour between 4pm and 5pm. An inspection of the average traffic 
densities at each day reveals a relatively clear separation between days at which traffic becomes 
congested, and days at which traffic densities remain moderate. We therefore separate the 
74 days into 43 "congested days", defined as the time-averaged traffic density (aggregated 
over the four lanes) exceeding 80 vehicles per km, and "non-congested days", which are the 
remaining 31 days. 

Figure [4] shows the time-averaged errors (18) for the 43 congested days in the top panels, 
and for the 31 non-congested days in the bottom panels. The day-by-day errors are given in 
the left panels, and the right panels show the errors averaged over the respective list of days. 
In all panels, the line style conventions are the same as in Fig. [3j These fully averaged errors 
are also given in rows 5-6 in Table [TJ as are the excess errors relative to the best model in 
parentheses. 

The results obtained with the RTMC data are not as conclusive as those obtained with the 
NGSIM data. Most likely, the significant noise level, measurement errors, and reconstruction 
errors of the single loop sensors play an important role in this. Still, a variety of conclusions 
can be drawn from the results. The fact that for the Greenshields flux, the second order 
model (ARZQ) yields more accurate results than the first order model (LWRQ), persists. 
In contrast, with the better- fitted three-parameter flux function ([2]), the differences between 
ARZ and LWR are marginal. What can be observed is that there are a few days (e.g., 26 or 
38) for which ARZ is much more inaccurate than LWR, while for the majority of days ARZ 
is more accurate than LWR (by a smaller margin). An inspection of the solutions on day 38 
reveals an explanation for the large error of ARZ: the densities and velocities at the boundaries 




10 20 30 40 50 60 70 

Day 

Error of different days with low density (RTMC sensor data) Average errors 




Figure 4. Comparison of traffic models for the RTMC sensor data on 74 week days in 2003. The 
top panels show 43 days with a high congestion level; the bottom panels show the remaining 31 days 
with low average densities. In each case, the time-averaged errors day- by-day are shown (left), and 
the effective errors, averaged over the considered days (right). All error plots are in log-scale. 



happen to create wave speeds very close to zero, and as a consequence, boundary data that 
is fed into the model does not reach sensor 2 within any reasonable time. A surprising result 
is that for the non-congested days, ARZQ yields smaller errors (factor 1.4) than LWR and 
ARZ. We have no explanation for this last phenomenon. 
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8. Conclusions and Outlook 

We have established a systematic methodology to compare the predictive accuracy of data- 
fitted macroscopic traffic models by using time-dependent data of two different types: vehicle 
trajectories and stationary sensors. The traffic models studied here are first order Lighthill- 
Whitham- Richards (LWR) models, whose flux functions are obtained via a least squares fit 
to historic fundamental diagram (FD) data, and their second order Aw-Rascle-Zhang (ARZ) 
generalizations. The presented approach is a fully macroscopic version of the three-detector 
problem test [10] , in the sense that the discrete data is transformed into continuously defined 
field quantities and that discretization effects are negligible. The following conclusions can 
be drawn from the conducted studies: 

• The use of better data-fitted flux functions tends to lead to more accurate models. The 
only exception from this rule is observed for sensor data in a non-congested regime. 

• Second order models are consistently as good as their first order counterparts, and 
in many cases they are significantly more accurate. This is remarkable, because the 
ARZ model does not contain more data-fitted parameters than the LWR model. A 
key reason for the better quality of second order models is that they allow for more 
independent information be fed into the solution through the boundaries. 

This study also gives rise to a variety of further opportunities and questions that should 
be investigated. Most importantly, the ARZ model's limitation of possessing a non-unique 
maximum density should be remedied. "Generic second order" traffic models, as introduced 
in [33], provide a framework to achieve this goal. We shall consider the predictive accuracy 
of such types of generalized ARZ models in an upcoming paper [12J. 

Another important model extension is to consider inhomogeneous second order models 
by adding a relaxation term. A simple version is the inhomogeneous ARZ model ([6]) with 
h(p) = U(0) — U(p), whose solutions relax towards LWR solutions. However, as pointed out 
in [17\ 113] , second order models with relaxation terms can possess much richer dynamics, if 
one chooses the functions U(p) and h{p) independently of each other. In this case, the models 
can develop instabilities ("phantom traffic jams") and traveling wave solutions ( "stop-and-go 
waves"). Moreover, as shown in [50], traffic waves can explain the observed spread in FD 
data, and therefore this generalization could give rise to a construction of U(p) and h(p) in 
different ways than conducted in this paper. 

An alternative class of models, that are specifically designed to match set-valued FDs, 
are phase-transition models [5j [3]. The incorporation of such models into the comparison 
framework established here would be an interesting aspect for future research. 

A different direction would be the extension of the macroscopic comparison framework to 
other types of data, such as GPS data, as used in the Mobile Millennium project [59J. And 
finally, it would be of interest to incorporate online calibration techniques, such as Kalman 
filters, into some of the traffic models studied here, and to investigate to which extent their 
predictive accuracy can be further improved. 
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